Overview
Brought to you by YData
Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 999 |
| Missing cells | 452 |
| Missing cells (%) | 2.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 848.2 KiB |
| Average record size in memory | 869.4 B |
Variable types
| Numeric | 12 |
|---|---|
| Text | 8 |
| Categorical | 1 |
Rating is highly overall correlated with Unnamed: 0 and 1 other fields | High correlation |
Revenue is highly overall correlated with Votes and 4 other fields | High correlation |
Unnamed: 0 is highly overall correlated with Rating and 1 other fields | High correlation |
Unnamed: 0.1 is highly overall correlated with Rating and 1 other fields | High correlation |
Votes is highly overall correlated with Revenue and 4 other fields | High correlation |
tmdb_budget is highly overall correlated with Revenue and 4 other fields | High correlation |
tmdb_popularity is highly overall correlated with Revenue and 4 other fields | High correlation |
tmdb_revenue is highly overall correlated with Revenue and 4 other fields | High correlation |
tmdb_vote_count is highly overall correlated with Revenue and 4 other fields | High correlation |
Certificate has 101 (10.1%) missing values | Missing |
scoreAvg has 157 (15.7%) missing values | Missing |
Revenue has 169 (16.9%) missing values | Missing |
Unnamed: 0.1 is uniformly distributed | Uniform |
Unnamed: 0 is uniformly distributed | Uniform |
Unnamed: 0.1 has unique values | Unique |
Unnamed: 0 has unique values | Unique |
Overview has unique values | Unique |
tmdb_budget has 150 (15.0%) zeros | Zeros |
tmdb_revenue has 112 (11.2%) zeros | Zeros |
Reproduction
| Analysis started | 2025-09-02 13:55:35.368843 |
|---|---|
| Analysis finished | 2025-09-02 13:56:58.479785 |
| Duration | 1 minute and 23.11 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
Unnamed: 0.1
Real number (ℝ)
High correlation  Uniform  Unique 
| Distinct | 999 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 499 |
| Minimum | 0 |
|---|---|
| Maximum | 998 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 49.9 |
| Q1 | 249.5 |
| median | 499 |
| Q3 | 748.5 |
| 95-th percentile | 948.1 |
| Maximum | 998 |
| Range | 998 |
| Interquartile range (IQR) | 499 |
Descriptive statistics
| Standard deviation | 288.53076 |
|---|---|
| Coefficient of variation (CV) | 0.57821796 |
| Kurtosis | -1.2 |
| Mean | 499 |
| Median Absolute Deviation (MAD) | 250 |
| Skewness | 0 |
| Sum | 498501 |
| Variance | 83250 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 998 | 1 | 0.1% |
| 0 | 1 | 0.1% |
| 1 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 3 | 1 | 0.1% |
| 4 | 1 | 0.1% |
| 5 | 1 | 0.1% |
| 982 | 1 | 0.1% |
| 981 | 1 | 0.1% |
| 980 | 1 | 0.1% |
| Other values (989) | 989 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 |
| Value | Count | Frequency (%) |
| 998 | 1 | |
| 997 | 1 | |
| 996 | 1 | |
| 995 | 1 | |
| 994 | 1 | |
| 993 | 1 | |
| 992 | 1 | |
| 991 | 1 | |
| 990 | 1 | |
| 989 | 1 |
Unnamed: 0
Real number (ℝ)
High correlation  Uniform  Unique 
| Distinct | 999 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 500 |
| Minimum | 1 |
|---|---|
| Maximum | 999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 50.9 |
| Q1 | 250.5 |
| median | 500 |
| Q3 | 749.5 |
| 95-th percentile | 949.1 |
| Maximum | 999 |
| Range | 998 |
| Interquartile range (IQR) | 499 |
Descriptive statistics
| Standard deviation | 288.53076 |
|---|---|
| Coefficient of variation (CV) | 0.57706152 |
| Kurtosis | -1.2 |
| Mean | 500 |
| Median Absolute Deviation (MAD) | 250 |
| Skewness | 0 |
| Sum | 499500 |
| Variance | 83250 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 999 | 1 | 0.1% |
| 1 | 1 | 0.1% |
| 2 | 1 | 0.1% |
| 3 | 1 | 0.1% |
| 4 | 1 | 0.1% |
| 5 | 1 | 0.1% |
| 6 | 1 | 0.1% |
| 983 | 1 | 0.1% |
| 982 | 1 | 0.1% |
| 981 | 1 | 0.1% |
| Other values (989) | 989 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 999 | 1 | |
| 998 | 1 | |
| 997 | 1 | |
| 996 | 1 | |
| 995 | 1 | |
| 994 | 1 | |
| 993 | 1 | |
| 992 | 1 | |
| 991 | 1 | |
| 990 | 1 |
Title
Text
| Distinct | 998 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 73.0 KiB |
Length
| Max length | 68 |
|---|---|
| Median length | 41 |
| Mean length | 15.443443 |
| Min length | 2 |
Unique
| Unique | 997 ? |
|---|---|
| Unique (%) | 99.8% |
Sample
| 1st row | The Godfather |
|---|---|
| 2nd row | The Dark Knight |
| 3rd row | The Godfather: Part II |
| 4th row | 12 Angry Men |
| 5th row | The Lord of the Rings: The Return of the King |
| Value | Count | Frequency (%) |
| the | 274 | 9.8% |
| of | 86 | 3.1% |
| a | 32 | 1.2% |
| and | 28 | 1.0% |
| no | 24 | 0.9% |
| la | 23 | 0.8% |
| in | 22 | 0.8% |
| to | 18 | 0.6% |
| de | 17 | 0.6% |
| man | 17 | 0.6% |
| Other values (1664) | 2241 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1783 | 11.6% | |
| e | 1425 | 9.2% |
| a | 1126 | 7.3% |
| o | 965 | 6.3% |
| n | 921 | 6.0% |
| i | 861 | 5.6% |
| r | 816 | 5.3% |
| t | 755 | 4.9% |
| h | 564 | 3.7% |
| s | 562 | 3.6% |
| Other values (90) | 5650 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11162 | |
| Uppercase Letter | 2191 | 14.2% |
| Space Separator | 1783 | 11.6% |
| Other Punctuation | 177 | 1.1% |
| Decimal Number | 79 | 0.5% |
| Dash Punctuation | 31 | 0.2% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
| Other Number | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1425 | |
| a | 1126 | |
| o | 965 | 8.6% |
| n | 921 | 8.3% |
| i | 861 | 7.7% |
| r | 816 | 7.3% |
| t | 755 | 6.8% |
| h | 564 | 5.1% |
| s | 562 | 5.0% |
| l | 514 | 4.6% |
| Other values (38) | 2653 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 283 | 12.9% |
| S | 187 | 8.5% |
| B | 157 | 7.2% |
| M | 139 | 6.3% |
| D | 129 | 5.9% |
| L | 119 | 5.4% |
| A | 113 | 5.2% |
| C | 101 | 4.6% |
| H | 98 | 4.5% |
| P | 97 | 4.4% |
| Other values (18) | 768 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 23 | |
| 1 | 15 | |
| 0 | 11 | |
| 3 | 8 | 10.1% |
| 4 | 5 | 6.3% |
| 7 | 5 | 6.3% |
| 9 | 4 | 5.1% |
| 5 | 4 | 5.1% |
| 8 | 2 | 2.5% |
| 6 | 2 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 62 | |
| . | 47 | |
| ' | 32 | |
| , | 16 | 9.0% |
| ! | 7 | 4.0% |
| & | 6 | 3.4% |
| ? | 3 | 1.7% |
| / | 3 | 1.7% |
| · | 1 | 0.6% |
Space Separator
| Value | Count | Frequency (%) |
| 1783 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 31 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13353 | |
| Common | 2075 | 13.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1425 | 10.7% |
| a | 1126 | 8.4% |
| o | 965 | 7.2% |
| n | 921 | 6.9% |
| i | 861 | 6.4% |
| r | 816 | 6.1% |
| t | 755 | 5.7% |
| h | 564 | 4.2% |
| s | 562 | 4.2% |
| l | 514 | 3.8% |
| Other values (66) | 4844 |
Common
| Value | Count | Frequency (%) |
| 1783 | ||
| : | 62 | 3.0% |
| . | 47 | 2.3% |
| ' | 32 | 1.5% |
| - | 31 | 1.5% |
| 2 | 23 | 1.1% |
| , | 16 | 0.8% |
| 1 | 15 | 0.7% |
| 0 | 11 | 0.5% |
| 3 | 8 | 0.4% |
| Other values (14) | 47 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15362 | |
| None | 66 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1783 | 11.6% | |
| e | 1425 | 9.3% |
| a | 1126 | 7.3% |
| o | 965 | 6.3% |
| n | 921 | 6.0% |
| i | 861 | 5.6% |
| r | 816 | 5.3% |
| t | 755 | 4.9% |
| h | 564 | 3.7% |
| s | 562 | 3.7% |
| Other values (64) | 5584 |
None
| Value | Count | Frequency (%) |
| ô | 14 | |
| é | 6 | 9.1% |
| û | 5 | 7.6% |
| è | 5 | 7.6% |
| â | 5 | 7.6% |
| ä | 4 | 6.1% |
| î | 2 | 3.0% |
| ù | 2 | 3.0% |
| ü | 2 | 3.0% |
| á | 2 | 3.0% |
| Other values (16) | 19 |
Year
Real number (ℝ)
| Distinct | 99 |
|---|---|
| Distinct (%) | 9.9% |
| Missing | 1 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1991.2144 |
| Minimum | 1920 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1920 |
|---|---|
| 5-th percentile | 1944 |
| Q1 | 1976 |
| median | 1999 |
| Q3 | 2009 |
| 95-th percentile | 2017 |
| Maximum | 2020 |
| Range | 100 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 23.308539 |
|---|---|
| Coefficient of variation (CV) | 0.01170569 |
| Kurtosis | -0.02478235 |
| Mean | 1991.2144 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | -0.93854006 |
| Sum | 1987232 |
| Variance | 543.28798 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2014 | 32 | 3.2% |
| 2004 | 31 | 3.1% |
| 2009 | 29 | 2.9% |
| 2013 | 28 | 2.8% |
| 2016 | 28 | 2.8% |
| 2001 | 27 | 2.7% |
| 2006 | 26 | 2.6% |
| 2007 | 26 | 2.6% |
| 2015 | 25 | 2.5% |
| 2012 | 24 | 2.4% |
| Other values (89) | 722 |
| Value | Count | Frequency (%) |
| 1920 | 1 | 0.1% |
| 1921 | 1 | 0.1% |
| 1922 | 1 | 0.1% |
| 1924 | 1 | 0.1% |
| 1925 | 2 | |
| 1926 | 1 | 0.1% |
| 1927 | 2 | |
| 1928 | 2 | |
| 1930 | 1 | 0.1% |
| 1931 | 3 |
| Value | Count | Frequency (%) |
| 2020 | 6 | 0.6% |
| 2019 | 23 | |
| 2018 | 19 | |
| 2017 | 22 | |
| 2016 | 28 | |
| 2015 | 25 | |
| 2014 | 32 | |
| 2013 | 28 | |
| 2012 | 24 | |
| 2011 | 18 |
Certificate
Categorical
Missing 
| Distinct | 16 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 101 |
| Missing (%) | 10.1% |
| Memory size | 58.0 KiB |
| U | |
|---|---|
| A | |
| UA | |
| R | |
| PG-13 | |
| Other values (11) |
Length
| Max length | 8 |
|---|---|
| Median length | 1 |
| Mean length | 1.7371938 |
| Min length | 1 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | A |
|---|---|
| 2nd row | UA |
| 3rd row | A |
| 4th row | U |
| 5th row | U |
Common Values
| Value | Count | Frequency (%) |
| U | 234 | |
| A | 196 | |
| UA | 175 | |
| R | 146 | |
| PG-13 | 43 | 4.3% |
| PG | 37 | 3.7% |
| Passed | 34 | 3.4% |
| G | 12 | 1.2% |
| Approved | 11 | 1.1% |
| TV-PG | 3 | 0.3% |
| Other values (6) | 7 | 0.7% |
| (Missing) | 101 |
Length
| Value | Count | Frequency (%) |
| u | 234 | |
| a | 196 | |
| ua | 175 | |
| r | 146 | |
| pg-13 | 43 | 4.8% |
| pg | 37 | 4.1% |
| passed | 34 | 3.8% |
| g | 12 | 1.3% |
| approved | 11 | 1.2% |
| tv-pg | 3 | 0.3% |
| Other values (6) | 7 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 411 | |
| A | 384 | |
| R | 146 | 9.4% |
| P | 119 | 7.6% |
| G | 97 | 6.2% |
| s | 68 | 4.4% |
| - | 48 | 3.1% |
| e | 46 | 2.9% |
| d | 46 | 2.9% |
| 1 | 45 | 2.9% |
| Other values (14) | 150 | 9.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1168 | |
| Lowercase Letter | 253 | 16.2% |
| Decimal Number | 90 | 5.8% |
| Dash Punctuation | 48 | 3.1% |
| Other Punctuation | 1 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 68 | |
| e | 46 | |
| d | 46 | |
| a | 35 | |
| p | 22 | 8.7% |
| r | 12 | 4.7% |
| o | 11 | 4.3% |
| v | 11 | 4.3% |
| n | 1 | 0.4% |
| t | 1 | 0.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 411 | |
| A | 384 | |
| R | 146 | 12.5% |
| P | 119 | 10.2% |
| G | 97 | 8.3% |
| T | 5 | 0.4% |
| V | 5 | 0.4% |
| M | 1 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 45 | |
| 3 | 43 | |
| 4 | 1 | 1.1% |
| 6 | 1 | 1.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 48 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1421 | |
| Common | 139 | 8.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 411 | |
| A | 384 | |
| R | 146 | 10.3% |
| P | 119 | 8.4% |
| G | 97 | 6.8% |
| s | 68 | 4.8% |
| e | 46 | 3.2% |
| d | 46 | 3.2% |
| a | 35 | 2.5% |
| p | 22 | 1.5% |
| Other values (8) | 47 | 3.3% |
Common
| Value | Count | Frequency (%) |
| - | 48 | |
| 1 | 45 | |
| 3 | 43 | |
| 4 | 1 | 0.7% |
| 6 | 1 | 0.7% |
| / | 1 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1560 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 411 | |
| A | 384 | |
| R | 146 | 9.4% |
| P | 119 | 7.6% |
| G | 97 | 6.2% |
| s | 68 | 4.4% |
| - | 48 | 3.1% |
| e | 46 | 2.9% |
| d | 46 | 2.9% |
| 1 | 45 | 2.9% |
| Other values (14) | 150 | 9.6% |
Runtime
Real number (ℝ)
| Distinct | 140 |
|---|---|
| Distinct (%) | 14.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 122.87187 |
| Minimum | 45 |
|---|---|
| Maximum | 321 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 45 |
|---|---|
| 5-th percentile | 87 |
| Q1 | 103 |
| median | 119 |
| Q3 | 137 |
| 95-th percentile | 178 |
| Maximum | 321 |
| Range | 276 |
| Interquartile range (IQR) | 34 |
Descriptive statistics
| Standard deviation | 28.101227 |
|---|---|
| Coefficient of variation (CV) | 0.2287035 |
| Kurtosis | 3.4289066 |
| Mean | 122.87187 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 1.2098771 |
| Sum | 122749 |
| Variance | 789.67896 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 23 | 2.3% |
| 130 | 23 | 2.3% |
| 129 | 22 | 2.2% |
| 101 | 22 | 2.2% |
| 113 | 22 | 2.2% |
| 110 | 20 | 2.0% |
| 122 | 20 | 2.0% |
| 108 | 19 | 1.9% |
| 102 | 18 | 1.8% |
| 96 | 17 | 1.7% |
| Other values (130) | 793 |
| Value | Count | Frequency (%) |
| 45 | 1 | 0.1% |
| 64 | 1 | 0.1% |
| 67 | 1 | 0.1% |
| 68 | 1 | 0.1% |
| 69 | 1 | 0.1% |
| 70 | 1 | 0.1% |
| 71 | 2 | |
| 72 | 2 | |
| 75 | 2 | |
| 76 | 3 |
| Value | Count | Frequency (%) |
| 321 | 1 | |
| 242 | 1 | |
| 238 | 1 | |
| 229 | 1 | |
| 228 | 1 | |
| 224 | 1 | |
| 220 | 1 | |
| 212 | 1 | |
| 210 | 1 | |
| 209 | 1 |
Genre
Text
| Distinct | 202 |
|---|---|
| Distinct (%) | 20.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 74.3 KiB |
Length
| Max length | 29 |
|---|---|
| Median length | 24 |
| Mean length | 19.077077 |
| Min length | 5 |
Unique
| Unique | 72 ? |
|---|---|
| Unique (%) | 7.2% |
Sample
| 1st row | Crime, Drama |
|---|---|
| 2nd row | Action, Crime, Drama |
| 3rd row | Crime, Drama |
| 4th row | Crime, Drama |
| 5th row | Action, Adventure, Drama |
| Value | Count | Frequency (%) |
| drama | 723 | |
| comedy | 233 | 9.2% |
| crime | 209 | 8.2% |
| adventure | 196 | 7.7% |
| action | 189 | 7.4% |
| thriller | 137 | 5.4% |
| romance | 125 | 4.9% |
| biography | 109 | 4.3% |
| mystery | 99 | 3.9% |
| animation | 82 | 3.2% |
| Other values (11) | 438 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2018 | 10.6% |
| r | 1871 | 9.8% |
| , | 1541 | 8.1% |
| 1541 | 8.1% | |
| m | 1447 | 7.6% |
| e | 1235 | 6.5% |
| i | 1144 | 6.0% |
| o | 896 | 4.7% |
| n | 760 | 4.0% |
| t | 727 | 3.8% |
| Other values (23) | 5878 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13264 | |
| Uppercase Letter | 2626 | 13.8% |
| Other Punctuation | 1541 | 8.1% |
| Space Separator | 1541 | 8.1% |
| Dash Punctuation | 86 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2018 | |
| r | 1871 | |
| m | 1447 | |
| e | 1235 | |
| i | 1144 | |
| o | 896 | |
| n | 760 | 5.7% |
| t | 727 | 5.5% |
| y | 718 | 5.4% |
| c | 433 | 3.3% |
| Other values (8) | 2015 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 723 | |
| A | 467 | |
| C | 442 | |
| F | 208 | 7.9% |
| M | 151 | 5.8% |
| T | 137 | 5.2% |
| R | 125 | 4.8% |
| B | 109 | 4.2% |
| H | 88 | 3.4% |
| S | 86 | 3.3% |
| Other values (2) | 90 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1541 |
Space Separator
| Value | Count | Frequency (%) |
| 1541 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 86 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15890 | |
| Common | 3168 | 16.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2018 | |
| r | 1871 | |
| m | 1447 | 9.1% |
| e | 1235 | 7.8% |
| i | 1144 | 7.2% |
| o | 896 | 5.6% |
| n | 760 | 4.8% |
| t | 727 | 4.6% |
| D | 723 | 4.6% |
| y | 718 | 4.5% |
| Other values (20) | 4351 |
Common
| Value | Count | Frequency (%) |
| , | 1541 | |
| 1541 | ||
| - | 86 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19058 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2018 | 10.6% |
| r | 1871 | 9.8% |
| , | 1541 | 8.1% |
| 1541 | 8.1% | |
| m | 1447 | 7.6% |
| e | 1235 | 6.5% |
| i | 1144 | 6.0% |
| o | 896 | 4.7% |
| n | 760 | 4.0% |
| t | 727 | 3.8% |
| Other values (23) | 5878 |
Rating
Real number (ℝ)
High correlation 
| Distinct | 16 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.9479479 |
| Minimum | 7.6 |
|---|---|
| Maximum | 9.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 7.6 |
|---|---|
| 5-th percentile | 7.6 |
| Q1 | 7.7 |
| median | 7.9 |
| Q3 | 8.1 |
| 95-th percentile | 8.5 |
| Maximum | 9.2 |
| Range | 1.6 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.27228951 |
|---|---|
| Coefficient of variation (CV) | 0.034259096 |
| Kurtosis | 1.0583968 |
| Mean | 7.9479479 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 0.94669269 |
| Sum | 7940 |
| Variance | 0.074141576 |
| Monotonicity | Decreasing |
| Value | Count | Frequency (%) |
| 7.7 | 157 | |
| 7.8 | 151 | |
| 8 | 141 | |
| 8.1 | 127 | |
| 7.6 | 123 | |
| 7.9 | 106 | |
| 8.2 | 67 | |
| 8.3 | 44 | 4.4% |
| 8.4 | 31 | 3.1% |
| 8.5 | 20 | 2.0% |
| Other values (6) | 32 | 3.2% |
| Value | Count | Frequency (%) |
| 7.6 | 123 | |
| 7.7 | 157 | |
| 7.8 | 151 | |
| 7.9 | 106 | |
| 8 | 141 | |
| 8.1 | 127 | |
| 8.2 | 67 | |
| 8.3 | 44 | 4.4% |
| 8.4 | 31 | 3.1% |
| 8.5 | 20 | 2.0% |
| Value | Count | Frequency (%) |
| 9.2 | 1 | 0.1% |
| 9 | 3 | 0.3% |
| 8.9 | 3 | 0.3% |
| 8.8 | 5 | 0.5% |
| 8.7 | 5 | 0.5% |
| 8.6 | 15 | 1.5% |
| 8.5 | 20 | 2.0% |
| 8.4 | 31 | |
| 8.3 | 44 | |
| 8.2 | 67 |
Overview
Text
Unique 
| Distinct | 999 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 201.4 KiB |
Length
| Max length | 313 |
|---|---|
| Median length | 197 |
| Mean length | 146.28328 |
| Min length | 40 |
Unique
| Unique | 999 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | An organized crime dynasty's aging patriarch transfers control of his clandestine empire to his reluctant son. |
|---|---|
| 2nd row | When the menace known as the Joker wreaks havoc and chaos on the people of Gotham, Batman must accept one of the greatest psychological and physical tests of his ability to fight injustice. |
| 3rd row | The early life and career of Vito Corleone in 1920s New York City is portrayed, while his son, Michael, expands and tightens his grip on the family crime syndicate. |
| 4th row | A jury holdout attempts to prevent a miscarriage of justice by forcing his colleagues to reconsider the evidence. |
| 5th row | Gandalf and Aragorn lead the World of Men against Sauron's army to draw his gaze from Frodo and Sam as they approach Mount Doom with the One Ring. |
| Value | Count | Frequency (%) |
| a | 1609 | 6.4% |
| the | 1206 | 4.8% |
| to | 803 | 3.2% |
| of | 777 | 3.1% |
| and | 696 | 2.8% |
| in | 565 | 2.3% |
| his | 516 | 2.1% |
| an | 291 | 1.2% |
| is | 245 | 1.0% |
| with | 242 | 1.0% |
| Other values (5878) | 18034 |
Most occurring characters
| Value | Count | Frequency (%) |
| 23999 | ||
| e | 13867 | 9.5% |
| a | 9800 | 6.7% |
| t | 9329 | 6.4% |
| i | 8842 | 6.1% |
| n | 8580 | 5.9% |
| o | 8559 | 5.9% |
| r | 8202 | 5.6% |
| s | 7965 | 5.5% |
| h | 5625 | 3.8% |
| Other values (76) | 41369 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 114964 | |
| Space Separator | 24000 | 16.4% |
| Uppercase Letter | 3515 | 2.4% |
| Other Punctuation | 2721 | 1.9% |
| Decimal Number | 509 | 0.3% |
| Dash Punctuation | 395 | 0.3% |
| Open Punctuation | 13 | < 0.1% |
| Close Punctuation | 13 | < 0.1% |
| Currency Symbol | 4 | < 0.1% |
| Final Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 13867 | |
| a | 9800 | 8.5% |
| t | 9329 | 8.1% |
| i | 8842 | 7.7% |
| n | 8580 | 7.5% |
| o | 8559 | 7.4% |
| r | 8202 | 7.1% |
| s | 7965 | 6.9% |
| h | 5625 | 4.9% |
| l | 4847 | 4.2% |
| Other values (23) | 29348 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 712 | |
| T | 258 | 7.3% |
| I | 258 | 7.3% |
| W | 228 | 6.5% |
| S | 223 | 6.3% |
| B | 176 | 5.0% |
| M | 167 | 4.8% |
| C | 158 | 4.5% |
| H | 139 | 4.0% |
| R | 119 | 3.4% |
| Other values (17) | 1077 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 117 | |
| 0 | 104 | |
| 9 | 94 | |
| 2 | 43 | 8.4% |
| 6 | 33 | 6.5% |
| 7 | 30 | 5.9% |
| 5 | 26 | 5.1% |
| 8 | 23 | 4.5% |
| 4 | 21 | 4.1% |
| 3 | 18 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1278 | |
| , | 1082 | |
| ' | 260 | 9.6% |
| " | 60 | 2.2% |
| : | 16 | 0.6% |
| ? | 11 | 0.4% |
| / | 8 | 0.3% |
| ; | 6 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 23999 | ||
| 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 395 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 13 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 13 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 4 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 2 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 118479 | |
| Common | 27658 | 18.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 13867 | |
| a | 9800 | 8.3% |
| t | 9329 | 7.9% |
| i | 8842 | 7.5% |
| n | 8580 | 7.2% |
| o | 8559 | 7.2% |
| r | 8202 | 6.9% |
| s | 7965 | 6.7% |
| h | 5625 | 4.7% |
| l | 4847 | 4.1% |
| Other values (50) | 32863 |
Common
| Value | Count | Frequency (%) |
| 23999 | ||
| . | 1278 | 4.6% |
| , | 1082 | 3.9% |
| - | 395 | 1.4% |
| ' | 260 | 0.9% |
| 1 | 117 | 0.4% |
| 0 | 104 | 0.4% |
| 9 | 94 | 0.3% |
| " | 60 | 0.2% |
| 2 | 43 | 0.2% |
| Other values (16) | 226 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 146116 | |
| None | 21 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 23999 | ||
| e | 13867 | 9.5% |
| a | 9800 | 6.7% |
| t | 9329 | 6.4% |
| i | 8842 | 6.1% |
| n | 8580 | 5.9% |
| o | 8559 | 5.9% |
| r | 8202 | 5.6% |
| s | 7965 | 5.5% |
| h | 5625 | 3.8% |
| Other values (65) | 41348 |
None
| Value | Count | Frequency (%) |
| é | 9 | |
| » | 2 | 9.5% |
| è | 2 | 9.5% |
| ü | 1 | 4.8% |
| 1 | 4.8% | |
| ä | 1 | 4.8% |
| ç | 1 | 4.8% |
| « | 1 | 4.8% |
| ö | 1 | 4.8% |
| É | 1 | 4.8% |
scoreAvg
Real number (ℝ)
Missing 
| Distinct | 63 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 157 |
| Missing (%) | 15.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77.969121 |
| Minimum | 28 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 28 |
|---|---|
| 5-th percentile | 56 |
| Q1 | 70 |
| median | 79 |
| Q3 | 87 |
| 95-th percentile | 96 |
| Maximum | 100 |
| Range | 72 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 12.383257 |
|---|---|
| Coefficient of variation (CV) | 0.15882258 |
| Kurtosis | 0.41651678 |
| Mean | 77.969121 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.60431623 |
| Sum | 65650 |
| Variance | 153.34506 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 76 | 32 | 3.2% |
| 90 | 29 | 2.9% |
| 84 | 29 | 2.9% |
| 85 | 27 | 2.7% |
| 72 | 27 | 2.7% |
| 86 | 27 | 2.7% |
| 73 | 27 | 2.7% |
| 81 | 26 | 2.6% |
| 77 | 26 | 2.6% |
| 80 | 26 | 2.6% |
| Other values (53) | 566 | |
| (Missing) | 157 | 15.7% |
| Value | Count | Frequency (%) |
| 28 | 1 | 0.1% |
| 30 | 1 | 0.1% |
| 33 | 1 | 0.1% |
| 36 | 1 | 0.1% |
| 40 | 1 | 0.1% |
| 41 | 1 | 0.1% |
| 44 | 1 | 0.1% |
| 45 | 3 | |
| 46 | 1 | 0.1% |
| 47 | 4 |
| Value | Count | Frequency (%) |
| 100 | 12 | |
| 99 | 4 | 0.4% |
| 98 | 9 | |
| 97 | 12 | |
| 96 | 18 | |
| 95 | 11 | |
| 94 | 20 | |
| 93 | 14 | |
| 92 | 13 | |
| 91 | 19 |
Director
Text
| Distinct | 548 |
|---|---|
| Distinct (%) | 54.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.7 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 22 |
| Mean length | 13.485485 |
| Min length | 7 |
Unique
| Unique | 353 ? |
|---|---|
| Unique (%) | 35.3% |
Sample
| 1st row | Francis Ford Coppola |
|---|---|
| 2nd row | Christopher Nolan |
| 3rd row | Francis Ford Coppola |
| 4th row | Sidney Lumet |
| 5th row | Peter Jackson |
| Value | Count | Frequency (%) |
| john | 34 | 1.6% |
| david | 28 | 1.4% |
| james | 23 | 1.1% |
| robert | 20 | 1.0% |
| martin | 16 | 0.8% |
| richard | 15 | 0.7% |
| lee | 15 | 0.7% |
| george | 14 | 0.7% |
| steven | 14 | 0.7% |
| alfred | 14 | 0.7% |
| Other values (882) | 1879 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1209 | 9.0% |
| a | 1126 | 8.4% |
| 1073 | 8.0% | |
| n | 950 | 7.1% |
| r | 917 | 6.8% |
| o | 851 | 6.3% |
| i | 834 | 6.2% |
| l | 543 | 4.0% |
| s | 497 | 3.7% |
| t | 433 | 3.2% |
| Other values (59) | 5039 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10223 | |
| Uppercase Letter | 2107 | 15.6% |
| Space Separator | 1073 | 8.0% |
| Other Punctuation | 43 | 0.3% |
| Dash Punctuation | 26 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1209 | |
| a | 1126 | |
| n | 950 | 9.3% |
| r | 917 | 9.0% |
| o | 851 | 8.3% |
| i | 834 | 8.2% |
| l | 543 | 5.3% |
| s | 497 | 4.9% |
| t | 433 | 4.2% |
| h | 404 | 4.0% |
| Other values (26) | 2459 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 179 | 8.5% |
| A | 171 | 8.1% |
| M | 166 | 7.9% |
| J | 162 | 7.7% |
| C | 142 | 6.7% |
| R | 131 | 6.2% |
| H | 110 | 5.2% |
| B | 106 | 5.0% |
| T | 102 | 4.8% |
| D | 99 | 4.7% |
| Other values (19) | 739 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 41 | |
| ' | 2 | 4.7% |
Space Separator
| Value | Count | Frequency (%) |
| 1073 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 26 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12330 | |
| Common | 1142 | 8.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1209 | 9.8% |
| a | 1126 | 9.1% |
| n | 950 | 7.7% |
| r | 917 | 7.4% |
| o | 851 | 6.9% |
| i | 834 | 6.8% |
| l | 543 | 4.4% |
| s | 497 | 4.0% |
| t | 433 | 3.5% |
| h | 404 | 3.3% |
| Other values (55) | 4566 |
Common
| Value | Count | Frequency (%) |
| 1073 | ||
| . | 41 | 3.6% |
| - | 26 | 2.3% |
| ' | 2 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13421 | |
| None | 51 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1209 | 9.0% |
| a | 1126 | 8.4% |
| 1073 | 8.0% | |
| n | 950 | 7.1% |
| r | 917 | 6.8% |
| o | 851 | 6.3% |
| i | 834 | 6.2% |
| l | 543 | 4.0% |
| s | 497 | 3.7% |
| t | 433 | 3.2% |
| Other values (46) | 4988 |
None
| Value | Count | Frequency (%) |
| ó | 10 | |
| á | 9 | |
| é | 8 | |
| ñ | 7 | |
| ô | 5 | |
| ö | 3 | 5.9% |
| ç | 2 | 3.9% |
| Ö | 2 | 3.9% |
| Ô | 1 | 2.0% |
| Ç | 1 | 2.0% |
| Other values (3) | 3 | 5.9% |
Star1
Text
| Distinct | 659 |
|---|---|
| Distinct (%) | 66.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.3 KiB |
Length
| Max length | 25 |
|---|---|
| Median length | 21 |
| Mean length | 13.005005 |
| Min length | 4 |
Unique
| Unique | 502 ? |
|---|---|
| Unique (%) | 50.3% |
Sample
| 1st row | Marlon Brando |
|---|---|
| 2nd row | Christian Bale |
| 3rd row | Al Pacino |
| 4th row | Henry Fonda |
| 5th row | Elijah Wood |
| Value | Count | Frequency (%) |
| tom | 22 | 1.1% |
| daniel | 17 | 0.8% |
| robert | 17 | 0.8% |
| john | 16 | 0.8% |
| khan | 16 | 0.8% |
| james | 15 | 0.7% |
| michael | 12 | 0.6% |
| hanks | 12 | 0.6% |
| ethan | 11 | 0.5% |
| de | 11 | 0.5% |
| Other values (1112) | 1898 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1239 | 9.5% |
| e | 1088 | 8.4% |
| 1048 | 8.1% | |
| n | 951 | 7.3% |
| r | 816 | 6.3% |
| i | 794 | 6.1% |
| o | 767 | 5.9% |
| l | 590 | 4.5% |
| t | 453 | 3.5% |
| s | 438 | 3.4% |
| Other values (62) | 4808 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9784 | |
| Uppercase Letter | 2099 | 16.2% |
| Space Separator | 1048 | 8.1% |
| Dash Punctuation | 32 | 0.2% |
| Other Punctuation | 29 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1239 | |
| e | 1088 | |
| n | 951 | |
| r | 816 | 8.3% |
| i | 794 | 8.1% |
| o | 767 | 7.8% |
| l | 590 | 6.0% |
| t | 453 | 4.6% |
| s | 438 | 4.5% |
| h | 424 | 4.3% |
| Other values (29) | 2224 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 187 | 8.9% |
| M | 172 | 8.2% |
| J | 144 | 6.9% |
| D | 142 | 6.8% |
| B | 142 | 6.8% |
| S | 141 | 6.7% |
| R | 126 | 6.0% |
| A | 115 | 5.5% |
| H | 106 | 5.1% |
| T | 104 | 5.0% |
| Other values (19) | 720 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 19 | |
| ' | 10 |
Space Separator
| Value | Count | Frequency (%) |
| 1048 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 32 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11883 | |
| Common | 1109 | 8.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1239 | 10.4% |
| e | 1088 | 9.2% |
| n | 951 | 8.0% |
| r | 816 | 6.9% |
| i | 794 | 6.7% |
| o | 767 | 6.5% |
| l | 590 | 5.0% |
| t | 453 | 3.8% |
| s | 438 | 3.7% |
| h | 424 | 3.6% |
| Other values (58) | 4323 |
Common
| Value | Count | Frequency (%) |
| 1048 | ||
| - | 32 | 2.9% |
| . | 19 | 1.7% |
| ' | 10 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12937 | |
| None | 55 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1239 | 9.6% |
| e | 1088 | 8.4% |
| 1048 | 8.1% | |
| n | 951 | 7.4% |
| r | 816 | 6.3% |
| i | 794 | 6.1% |
| o | 767 | 5.9% |
| l | 590 | 4.6% |
| t | 453 | 3.5% |
| s | 438 | 3.4% |
| Other values (45) | 4753 |
None
| Value | Count | Frequency (%) |
| ô | 13 | |
| é | 7 | |
| ü | 6 | |
| í | 6 | |
| û | 4 | 7.3% |
| ö | 4 | 7.3% |
| è | 3 | 5.5% |
| å | 2 | 3.6% |
| ë | 2 | 3.6% |
| Ç | 1 | 1.8% |
| Other values (7) | 7 |
Star2
Text
| Distinct | 659 |
|---|---|
| Distinct (%) | 66.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.3 KiB |
Length
| Max length | 25 |
|---|---|
| Median length | 21 |
| Mean length | 13.005005 |
| Min length | 4 |
Unique
| Unique | 502 ? |
|---|---|
| Unique (%) | 50.3% |
Sample
| 1st row | Marlon Brando |
|---|---|
| 2nd row | Christian Bale |
| 3rd row | Al Pacino |
| 4th row | Henry Fonda |
| 5th row | Elijah Wood |
| Value | Count | Frequency (%) |
| tom | 22 | 1.1% |
| daniel | 17 | 0.8% |
| robert | 17 | 0.8% |
| john | 16 | 0.8% |
| khan | 16 | 0.8% |
| james | 15 | 0.7% |
| michael | 12 | 0.6% |
| hanks | 12 | 0.6% |
| ethan | 11 | 0.5% |
| de | 11 | 0.5% |
| Other values (1112) | 1898 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1239 | 9.5% |
| e | 1088 | 8.4% |
| 1048 | 8.1% | |
| n | 951 | 7.3% |
| r | 816 | 6.3% |
| i | 794 | 6.1% |
| o | 767 | 5.9% |
| l | 590 | 4.5% |
| t | 453 | 3.5% |
| s | 438 | 3.4% |
| Other values (62) | 4808 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9784 | |
| Uppercase Letter | 2099 | 16.2% |
| Space Separator | 1048 | 8.1% |
| Dash Punctuation | 32 | 0.2% |
| Other Punctuation | 29 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1239 | |
| e | 1088 | |
| n | 951 | |
| r | 816 | 8.3% |
| i | 794 | 8.1% |
| o | 767 | 7.8% |
| l | 590 | 6.0% |
| t | 453 | 4.6% |
| s | 438 | 4.5% |
| h | 424 | 4.3% |
| Other values (29) | 2224 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 187 | 8.9% |
| M | 172 | 8.2% |
| J | 144 | 6.9% |
| D | 142 | 6.8% |
| B | 142 | 6.8% |
| S | 141 | 6.7% |
| R | 126 | 6.0% |
| A | 115 | 5.5% |
| H | 106 | 5.1% |
| T | 104 | 5.0% |
| Other values (19) | 720 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 19 | |
| ' | 10 |
Space Separator
| Value | Count | Frequency (%) |
| 1048 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 32 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11883 | |
| Common | 1109 | 8.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1239 | 10.4% |
| e | 1088 | 9.2% |
| n | 951 | 8.0% |
| r | 816 | 6.9% |
| i | 794 | 6.7% |
| o | 767 | 6.5% |
| l | 590 | 5.0% |
| t | 453 | 3.8% |
| s | 438 | 3.7% |
| h | 424 | 3.6% |
| Other values (58) | 4323 |
Common
| Value | Count | Frequency (%) |
| 1048 | ||
| - | 32 | 2.9% |
| . | 19 | 1.7% |
| ' | 10 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12937 | |
| None | 55 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1239 | 9.6% |
| e | 1088 | 8.4% |
| 1048 | 8.1% | |
| n | 951 | 7.4% |
| r | 816 | 6.3% |
| i | 794 | 6.1% |
| o | 767 | 5.9% |
| l | 590 | 4.6% |
| t | 453 | 3.5% |
| s | 438 | 3.4% |
| Other values (45) | 4753 |
None
| Value | Count | Frequency (%) |
| ô | 13 | |
| é | 7 | |
| ü | 6 | |
| í | 6 | |
| û | 4 | 7.3% |
| ö | 4 | 7.3% |
| è | 3 | 5.5% |
| å | 2 | 3.6% |
| ë | 2 | 3.6% |
| Ç | 1 | 1.8% |
| Other values (7) | 7 |
Star3
Text
| Distinct | 659 |
|---|---|
| Distinct (%) | 66.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.3 KiB |
Length
| Max length | 25 |
|---|---|
| Median length | 21 |
| Mean length | 13.005005 |
| Min length | 4 |
Unique
| Unique | 502 ? |
|---|---|
| Unique (%) | 50.3% |
Sample
| 1st row | Marlon Brando |
|---|---|
| 2nd row | Christian Bale |
| 3rd row | Al Pacino |
| 4th row | Henry Fonda |
| 5th row | Elijah Wood |
| Value | Count | Frequency (%) |
| tom | 22 | 1.1% |
| daniel | 17 | 0.8% |
| robert | 17 | 0.8% |
| john | 16 | 0.8% |
| khan | 16 | 0.8% |
| james | 15 | 0.7% |
| michael | 12 | 0.6% |
| hanks | 12 | 0.6% |
| ethan | 11 | 0.5% |
| de | 11 | 0.5% |
| Other values (1112) | 1898 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1239 | 9.5% |
| e | 1088 | 8.4% |
| 1048 | 8.1% | |
| n | 951 | 7.3% |
| r | 816 | 6.3% |
| i | 794 | 6.1% |
| o | 767 | 5.9% |
| l | 590 | 4.5% |
| t | 453 | 3.5% |
| s | 438 | 3.4% |
| Other values (62) | 4808 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9784 | |
| Uppercase Letter | 2099 | 16.2% |
| Space Separator | 1048 | 8.1% |
| Dash Punctuation | 32 | 0.2% |
| Other Punctuation | 29 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1239 | |
| e | 1088 | |
| n | 951 | |
| r | 816 | 8.3% |
| i | 794 | 8.1% |
| o | 767 | 7.8% |
| l | 590 | 6.0% |
| t | 453 | 4.6% |
| s | 438 | 4.5% |
| h | 424 | 4.3% |
| Other values (29) | 2224 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 187 | 8.9% |
| M | 172 | 8.2% |
| J | 144 | 6.9% |
| D | 142 | 6.8% |
| B | 142 | 6.8% |
| S | 141 | 6.7% |
| R | 126 | 6.0% |
| A | 115 | 5.5% |
| H | 106 | 5.1% |
| T | 104 | 5.0% |
| Other values (19) | 720 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 19 | |
| ' | 10 |
Space Separator
| Value | Count | Frequency (%) |
| 1048 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 32 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11883 | |
| Common | 1109 | 8.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1239 | 10.4% |
| e | 1088 | 9.2% |
| n | 951 | 8.0% |
| r | 816 | 6.9% |
| i | 794 | 6.7% |
| o | 767 | 6.5% |
| l | 590 | 5.0% |
| t | 453 | 3.8% |
| s | 438 | 3.7% |
| h | 424 | 3.6% |
| Other values (58) | 4323 |
Common
| Value | Count | Frequency (%) |
| 1048 | ||
| - | 32 | 2.9% |
| . | 19 | 1.7% |
| ' | 10 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12937 | |
| None | 55 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1239 | 9.6% |
| e | 1088 | 8.4% |
| 1048 | 8.1% | |
| n | 951 | 7.4% |
| r | 816 | 6.3% |
| i | 794 | 6.1% |
| o | 767 | 5.9% |
| l | 590 | 4.6% |
| t | 453 | 3.5% |
| s | 438 | 3.4% |
| Other values (45) | 4753 |
None
| Value | Count | Frequency (%) |
| ô | 13 | |
| é | 7 | |
| ü | 6 | |
| í | 6 | |
| û | 4 | 7.3% |
| ö | 4 | 7.3% |
| è | 3 | 5.5% |
| å | 2 | 3.6% |
| ë | 2 | 3.6% |
| Ç | 1 | 1.8% |
| Other values (7) | 7 |
Star4
Text
| Distinct | 659 |
|---|---|
| Distinct (%) | 66.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 70.3 KiB |
Length
| Max length | 25 |
|---|---|
| Median length | 21 |
| Mean length | 13.005005 |
| Min length | 4 |
Unique
| Unique | 502 ? |
|---|---|
| Unique (%) | 50.3% |
Sample
| 1st row | Marlon Brando |
|---|---|
| 2nd row | Christian Bale |
| 3rd row | Al Pacino |
| 4th row | Henry Fonda |
| 5th row | Elijah Wood |
| Value | Count | Frequency (%) |
| tom | 22 | 1.1% |
| daniel | 17 | 0.8% |
| robert | 17 | 0.8% |
| john | 16 | 0.8% |
| khan | 16 | 0.8% |
| james | 15 | 0.7% |
| michael | 12 | 0.6% |
| hanks | 12 | 0.6% |
| ethan | 11 | 0.5% |
| de | 11 | 0.5% |
| Other values (1112) | 1898 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1239 | 9.5% |
| e | 1088 | 8.4% |
| 1048 | 8.1% | |
| n | 951 | 7.3% |
| r | 816 | 6.3% |
| i | 794 | 6.1% |
| o | 767 | 5.9% |
| l | 590 | 4.5% |
| t | 453 | 3.5% |
| s | 438 | 3.4% |
| Other values (62) | 4808 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9784 | |
| Uppercase Letter | 2099 | 16.2% |
| Space Separator | 1048 | 8.1% |
| Dash Punctuation | 32 | 0.2% |
| Other Punctuation | 29 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1239 | |
| e | 1088 | |
| n | 951 | |
| r | 816 | 8.3% |
| i | 794 | 8.1% |
| o | 767 | 7.8% |
| l | 590 | 6.0% |
| t | 453 | 4.6% |
| s | 438 | 4.5% |
| h | 424 | 4.3% |
| Other values (29) | 2224 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 187 | 8.9% |
| M | 172 | 8.2% |
| J | 144 | 6.9% |
| D | 142 | 6.8% |
| B | 142 | 6.8% |
| S | 141 | 6.7% |
| R | 126 | 6.0% |
| A | 115 | 5.5% |
| H | 106 | 5.1% |
| T | 104 | 5.0% |
| Other values (19) | 720 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 19 | |
| ' | 10 |
Space Separator
| Value | Count | Frequency (%) |
| 1048 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 32 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11883 | |
| Common | 1109 | 8.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1239 | 10.4% |
| e | 1088 | 9.2% |
| n | 951 | 8.0% |
| r | 816 | 6.9% |
| i | 794 | 6.7% |
| o | 767 | 6.5% |
| l | 590 | 5.0% |
| t | 453 | 3.8% |
| s | 438 | 3.7% |
| h | 424 | 3.6% |
| Other values (58) | 4323 |
Common
| Value | Count | Frequency (%) |
| 1048 | ||
| - | 32 | 2.9% |
| . | 19 | 1.7% |
| ' | 10 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12937 | |
| None | 55 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1239 | 9.6% |
| e | 1088 | 8.4% |
| 1048 | 8.1% | |
| n | 951 | 7.4% |
| r | 816 | 6.3% |
| i | 794 | 6.1% |
| o | 767 | 5.9% |
| l | 590 | 4.6% |
| t | 453 | 3.5% |
| s | 438 | 3.4% |
| Other values (45) | 4753 |
None
| Value | Count | Frequency (%) |
| ô | 13 | |
| é | 7 | |
| ü | 6 | |
| í | 6 | |
| û | 4 | 7.3% |
| ö | 4 | 7.3% |
| è | 3 | 5.5% |
| å | 2 | 3.6% |
| ë | 2 | 3.6% |
| Ç | 1 | 1.8% |
| Other values (7) | 7 |
Votes
Real number (ℝ)
High correlation 
| Distinct | 998 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 271621.42 |
| Minimum | 25088 |
|---|---|
| Maximum | 2303232 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 25088 |
|---|---|
| 5-th percentile | 29680 |
| Q1 | 55471.5 |
| median | 138356 |
| Q3 | 373167.5 |
| 95-th percentile | 939289.9 |
| Maximum | 2303232 |
| Range | 2278144 |
| Interquartile range (IQR) | 317696 |
Descriptive statistics
| Standard deviation | 320912.62 |
|---|---|
| Coefficient of variation (CV) | 1.1814702 |
| Kurtosis | 6.041324 |
| Mean | 271621.42 |
| Median Absolute Deviation (MAD) | 98475 |
| Skewness | 2.1943511 |
| Sum | 2.713498 × 108 |
| Variance | 1.0298491 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 65341 | 2 | 0.2% |
| 171640 | 1 | 0.1% |
| 699256 | 1 | 0.1% |
| 32802 | 1 | 0.1% |
| 93878 | 1 | 0.1% |
| 1213505 | 1 | 0.1% |
| 51853 | 1 | 0.1% |
| 1642758 | 1 | 0.1% |
| 2067042 | 1 | 0.1% |
| 1854740 | 1 | 0.1% |
| Other values (988) | 988 |
| Value | Count | Frequency (%) |
| 25088 | 1 | |
| 25198 | 1 | |
| 25229 | 1 | |
| 25312 | 1 | |
| 25344 | 1 | |
| 25938 | 1 | |
| 26337 | 1 | |
| 26402 | 1 | |
| 26429 | 1 | |
| 26457 | 1 |
| Value | Count | Frequency (%) |
| 2303232 | 1 | |
| 2067042 | 1 | |
| 1854740 | 1 | |
| 1826188 | 1 | |
| 1809221 | 1 | |
| 1676426 | 1 | |
| 1661481 | 1 | |
| 1642758 | 1 | |
| 1620367 | 1 | |
| 1516346 | 1 |
Revenue
Real number (ℝ)
High correlation  Missing 
| Distinct | 822 |
|---|---|
| Distinct (%) | 99.0% |
| Missing | 169 |
| Missing (%) | 16.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 68082574 |
| Minimum | 1305 |
|---|---|
| Maximum | 9.3666222 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1305 |
|---|---|
| 5-th percentile | 139783.9 |
| Q1 | 3245338.5 |
| median | 23457440 |
| Q3 | 80876340 |
| 95-th percentile | 2.9163069 × 108 |
| Maximum | 9.3666222 × 108 |
| Range | 9.3666092 × 108 |
| Interquartile range (IQR) | 77631002 |
Descriptive statistics
| Standard deviation | 1.0980755 × 108 |
|---|---|
| Coefficient of variation (CV) | 1.6128584 |
| Kurtosis | 13.894054 |
| Mean | 68082574 |
| Median Absolute Deviation (MAD) | 22698854 |
| Skewness | 3.1277452 |
| Sum | 5.6508537 × 1010 |
| Variance | 1.2057699 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4360000 | 5 | 0.5% |
| 5321508 | 2 | 0.2% |
| 5450000 | 2 | 0.2% |
| 9600000 | 2 | 0.2% |
| 25000000 | 2 | 0.2% |
| 216540909 | 1 | 0.1% |
| 49530280 | 1 | 0.1% |
| 78756177 | 1 | 0.1% |
| 292576195 | 1 | 0.1% |
| 30500000 | 1 | 0.1% |
| Other values (812) | 812 | |
| (Missing) | 169 | 16.9% |
| Value | Count | Frequency (%) |
| 1305 | 1 | |
| 3296 | 1 | |
| 3600 | 1 | |
| 6013 | 1 | |
| 6460 | 1 | |
| 7461 | 1 | |
| 8060 | 1 | |
| 10177 | 1 | |
| 10950 | 1 | |
| 12562 | 1 |
| Value | Count | Frequency (%) |
| 936662225 | 1 | |
| 858373000 | 1 | |
| 760507625 | 1 | |
| 678815482 | 1 | |
| 659325379 | 1 | |
| 623279547 | 1 | |
| 608581744 | 1 | |
| 534858444 | 1 | |
| 532177324 | 1 | |
| 448139099 | 1 |
tmdb_budget
Real number (ℝ)
High correlation  Zeros 
| Distinct | 284 |
|---|---|
| Distinct (%) | 28.6% |
| Missing | 6 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25839015 |
| Minimum | 0 |
|---|---|
| Maximum | 3.56 × 108 |
| Zeros | 150 |
| Zeros (%) | 15.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1255000 |
| median | 6900000 |
| Q3 | 25000000 |
| 95-th percentile | 1.5 × 108 |
| Maximum | 3.56 × 108 |
| Range | 3.56 × 108 |
| Interquartile range (IQR) | 23745000 |
Descriptive statistics
| Standard deviation | 47120610 |
|---|---|
| Coefficient of variation (CV) | 1.8236225 |
| Kurtosis | 10.178 |
| Mean | 25839015 |
| Median Absolute Deviation (MAD) | 6900000 |
| Skewness | 3.0205037 |
| Sum | 2.5658142 × 1010 |
| Variance | 2.2203519 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 150 | 15.0% |
| 15000000 | 30 | 3.0% |
| 4000000 | 27 | 2.7% |
| 25000000 | 21 | 2.1% |
| 3000000 | 20 | 2.0% |
| 12000000 | 19 | 1.9% |
| 6000000 | 18 | 1.8% |
| 2000000 | 18 | 1.8% |
| 40000000 | 17 | 1.7% |
| 30000000 | 17 | 1.7% |
| Other values (274) | 656 |
| Value | Count | Frequency (%) |
| 0 | 150 | |
| 105 | 1 | 0.1% |
| 3025 | 1 | 0.1% |
| 18000 | 1 | 0.1% |
| 27575 | 1 | 0.1% |
| 64000 | 1 | 0.1% |
| 114000 | 1 | 0.1% |
| 120000 | 1 | 0.1% |
| 133000 | 1 | 0.1% |
| 150000 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 356000000 | 1 | 0.1% |
| 300000000 | 1 | 0.1% |
| 260000000 | 1 | 0.1% |
| 250000000 | 7 | |
| 245000000 | 1 | 0.1% |
| 237000000 | 1 | 0.1% |
| 220000000 | 1 | 0.1% |
| 200000000 | 6 | |
| 190000000 | 1 | 0.1% |
| 185000000 | 1 | 0.1% |
tmdb_popularity
Real number (ℝ)
High correlation 
| Distinct | 983 |
|---|---|
| Distinct (%) | 99.0% |
| Missing | 6 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.223619 |
| Minimum | 0.0096 |
|---|---|
| Maximum | 48.7572 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0.0096 |
|---|---|
| 5-th percentile | 1.45766 |
| Q1 | 2.6588 |
| median | 4.3308 |
| Q3 | 7.944 |
| 95-th percentile | 17.9107 |
| Maximum | 48.7572 |
| Range | 48.7476 |
| Interquartile range (IQR) | 5.2852 |
Descriptive statistics
| Standard deviation | 5.4444054 |
|---|---|
| Coefficient of variation (CV) | 0.87479734 |
| Kurtosis | 8.0700012 |
| Mean | 6.223619 |
| Median Absolute Deviation (MAD) | 2.1649 |
| Skewness | 2.2926687 |
| Sum | 6180.0537 |
| Variance | 29.64155 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.2227 | 2 | 0.2% |
| 1.6467 | 2 | 0.2% |
| 6.6648 | 2 | 0.2% |
| 1.6796 | 2 | 0.2% |
| 2.581 | 2 | 0.2% |
| 2.7535 | 2 | 0.2% |
| 3.0026 | 2 | 0.2% |
| 5.1589 | 2 | 0.2% |
| 4.7199 | 2 | 0.2% |
| 2.6488 | 2 | 0.2% |
| Other values (973) | 973 | |
| (Missing) | 6 | 0.6% |
| Value | Count | Frequency (%) |
| 0.0096 | 1 | |
| 0.0336 | 1 | |
| 0.0486 | 1 | |
| 0.1994 | 1 | |
| 0.5302 | 1 | |
| 0.5579 | 1 | |
| 0.619 | 1 | |
| 0.7419 | 1 | |
| 0.7471 | 1 | |
| 0.7693 | 1 |
| Value | Count | Frequency (%) |
| 48.7572 | 1 | |
| 39.8771 | 1 | |
| 37.3574 | 1 | |
| 32.4134 | 1 | |
| 32.1482 | 1 | |
| 27.8211 | 1 | |
| 26.7541 | 1 | |
| 24.7742 | 1 | |
| 24.7217 | 1 | |
| 24.6837 | 1 |
tmdb_vote_count
Real number (ℝ)
High correlation 
| Distinct | 929 |
|---|---|
| Distinct (%) | 93.6% |
| Missing | 6 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5496.4179 |
| Minimum | 0 |
|---|---|
| Maximum | 37863 |
| Zeros | 3 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 339.2 |
| Q1 | 1033 |
| median | 2582 |
| Q3 | 7816 |
| 95-th percentile | 19384.2 |
| Maximum | 37863 |
| Range | 37863 |
| Interquartile range (IQR) | 6783 |
Descriptive statistics
| Standard deviation | 6537.6424 |
|---|---|
| Coefficient of variation (CV) | 1.1894369 |
| Kurtosis | 3.4133625 |
| Mean | 5496.4179 |
| Median Absolute Deviation (MAD) | 1897 |
| Skewness | 1.8374153 |
| Sum | 5457943 |
| Variance | 42740769 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3 | 0.3% |
| 286 | 3 | 0.3% |
| 815 | 3 | 0.3% |
| 4450 | 3 | 0.3% |
| 1167 | 3 | 0.3% |
| 1873 | 3 | 0.3% |
| 334 | 2 | 0.2% |
| 1981 | 2 | 0.2% |
| 2774 | 2 | 0.2% |
| 1499 | 2 | 0.2% |
| Other values (919) | 967 | |
| (Missing) | 6 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 1 | 1 | 0.1% |
| 4 | 1 | 0.1% |
| 11 | 1 | 0.1% |
| 33 | 1 | 0.1% |
| 100 | 1 | 0.1% |
| 106 | 1 | 0.1% |
| 128 | 1 | 0.1% |
| 133 | 1 | 0.1% |
| 135 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 37863 | 1 | |
| 37745 | 1 | |
| 34305 | 1 | |
| 32846 | 1 | |
| 32552 | 1 | |
| 31864 | 1 | |
| 30897 | 1 | |
| 30680 | 1 | |
| 29017 | 1 | |
| 28819 | 1 |
tmdb_revenue
Real number (ℝ)
High correlation  Zeros 
| Distinct | 832 |
|---|---|
| Distinct (%) | 83.8% |
| Missing | 6 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3668567 × 108 |
| Minimum | 0 |
|---|---|
| Maximum | 2.923706 × 109 |
| Zeros | 112 |
| Zeros (%) | 11.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 4396821 |
| median | 31500000 |
| Q3 | 1.249 × 108 |
| 95-th percentile | 6.7463509 × 108 |
| Maximum | 2.923706 × 109 |
| Range | 2.923706 × 109 |
| Interquartile range (IQR) | 1.2050318 × 108 |
Descriptive statistics
| Standard deviation | 2.7556795 × 108 |
|---|---|
| Coefficient of variation (CV) | 2.0160705 |
| Kurtosis | 29.232896 |
| Mean | 1.3668567 × 108 |
| Median Absolute Deviation (MAD) | 31481879 |
| Skewness | 4.4408458 |
| Sum | 1.3572887 × 1011 |
| Variance | 7.5937692 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 112 | 11.2% |
| 10000000 | 5 | 0.5% |
| 8000000 | 5 | 0.5% |
| 11000000 | 4 | 0.4% |
| 23300000 | 3 | 0.3% |
| 12000000 | 3 | 0.3% |
| 25000000 | 3 | 0.3% |
| 4000000 | 3 | 0.3% |
| 30000000 | 3 | 0.3% |
| 4500000 | 3 | 0.3% |
| Other values (822) | 849 | |
| (Missing) | 6 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 112 | |
| 3193 | 1 | 0.1% |
| 8811 | 1 | 0.1% |
| 12678 | 1 | 0.1% |
| 13422 | 1 | 0.1% |
| 18121 | 1 | 0.1% |
| 24173 | 1 | 0.1% |
| 24517 | 1 | 0.1% |
| 27105 | 1 | 0.1% |
| 35274 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 2923706026 | 1 | |
| 2799439100 | 1 | |
| 2264162353 | 1 | |
| 2068223624 | 1 | |
| 2052415039 | 1 | |
| 1518815515 | 1 | |
| 1341511219 | 1 | |
| 1243225667 | 1 | |
| 1155046416 | 1 | |
| 1118888979 | 1 |
Interactions
Correlations
| Certificate | Rating | Revenue | Runtime | Unnamed: 0 | Unnamed: 0.1 | Votes | Year | scoreAvg | tmdb_budget | tmdb_popularity | tmdb_revenue | tmdb_vote_count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Certificate | 1.000 | 0.000 | 0.063 | 0.141 | 0.072 | 0.072 | 0.057 | 0.302 | 0.088 | 0.006 | 0.061 | 0.049 | 0.091 |
| Rating | 0.000 | 1.000 | -0.050 | 0.210 | -0.992 | -0.992 | 0.212 | -0.127 | 0.285 | -0.105 | 0.152 | -0.029 | 0.113 |
| Revenue | 0.063 | -0.050 | 1.000 | 0.178 | 0.036 | 0.036 | 0.700 | 0.175 | -0.100 | 0.764 | 0.681 | 0.898 | 0.720 |
| Runtime | 0.141 | 0.210 | 0.178 | 1.000 | -0.233 | -0.233 | 0.157 | 0.194 | -0.090 | 0.309 | 0.172 | 0.259 | 0.066 |
| Unnamed: 0 | 0.072 | -0.992 | 0.036 | -0.233 | 1.000 | 1.000 | -0.245 | 0.012 | -0.259 | 0.061 | -0.177 | -0.012 | -0.150 |
| Unnamed: 0.1 | 0.072 | -0.992 | 0.036 | -0.233 | 1.000 | 1.000 | -0.245 | 0.012 | -0.259 | 0.061 | -0.177 | -0.012 | -0.150 |
| Votes | 0.057 | 0.212 | 0.700 | 0.157 | -0.245 | -0.245 | 1.000 | 0.255 | -0.073 | 0.675 | 0.790 | 0.739 | 0.925 |
| Year | 0.302 | -0.127 | 0.175 | 0.194 | 0.012 | 0.012 | 0.255 | 1.000 | -0.264 | 0.416 | 0.224 | 0.376 | 0.298 |
| scoreAvg | 0.088 | 0.285 | -0.100 | -0.090 | -0.259 | -0.259 | -0.073 | -0.264 | 1.000 | -0.259 | -0.108 | -0.183 | -0.106 |
| tmdb_budget | 0.006 | -0.105 | 0.764 | 0.309 | 0.061 | 0.061 | 0.675 | 0.416 | -0.259 | 1.000 | 0.656 | 0.807 | 0.687 |
| tmdb_popularity | 0.061 | 0.152 | 0.681 | 0.172 | -0.177 | -0.177 | 0.790 | 0.224 | -0.108 | 0.656 | 1.000 | 0.709 | 0.847 |
| tmdb_revenue | 0.049 | -0.029 | 0.898 | 0.259 | -0.012 | -0.012 | 0.739 | 0.376 | -0.183 | 0.807 | 0.709 | 1.000 | 0.754 |
| tmdb_vote_count | 0.091 | 0.113 | 0.720 | 0.066 | -0.150 | -0.150 | 0.925 | 0.298 | -0.106 | 0.687 | 0.847 | 0.754 | 1.000 |
Missing values
Sample
| Unnamed: 0.1 | Unnamed: 0 | Title | Year | Certificate | Runtime | Genre | Rating | Overview | scoreAvg | Director | Star1 | Star2 | Star3 | Star4 | Votes | Revenue | tmdb_budget | tmdb_popularity | tmdb_vote_count | tmdb_revenue | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 1 | The Godfather | 1972.0 | A | 175.0 | Crime, Drama | 9.2 | An organized crime dynasty's aging patriarch transfers control of his clandestine empire to his reluctant son. | 100.0 | Francis Ford Coppola | Marlon Brando | Marlon Brando | Marlon Brando | Marlon Brando | 1620367 | 134966411.0 | 6000000.0 | 24.7742 | 21774.0 | 2.450664e+08 |
| 1 | 1 | 2 | The Dark Knight | 2008.0 | UA | 152.0 | Action, Crime, Drama | 9.0 | When the menace known as the Joker wreaks havoc and chaos on the people of Gotham, Batman must accept one of the greatest psychological and physical tests of his ability to fight injustice. | 84.0 | Christopher Nolan | Christian Bale | Christian Bale | Christian Bale | Christian Bale | 2303232 | 534858444.0 | 185000000.0 | 27.8211 | 34305.0 | 1.004558e+09 |
| 2 | 2 | 3 | The Godfather: Part II | 1974.0 | A | 202.0 | Crime, Drama | 9.0 | The early life and career of Vito Corleone in 1920s New York City is portrayed, while his son, Michael, expands and tightens his grip on the family crime syndicate. | 90.0 | Francis Ford Coppola | Al Pacino | Al Pacino | Al Pacino | Al Pacino | 1129952 | 57300000.0 | 13000000.0 | 16.7095 | 13147.0 | 1.026000e+08 |
| 3 | 3 | 4 | 12 Angry Men | 1957.0 | U | 96.0 | Crime, Drama | 9.0 | A jury holdout attempts to prevent a miscarriage of justice by forcing his colleagues to reconsider the evidence. | 96.0 | Sidney Lumet | Henry Fonda | Henry Fonda | Henry Fonda | Henry Fonda | 689845 | 4360000.0 | 397751.0 | 13.4321 | 9358.0 | 4.360000e+06 |
| 4 | 4 | 5 | The Lord of the Rings: The Return of the King | 2003.0 | U | 201.0 | Action, Adventure, Drama | 8.9 | Gandalf and Aragorn lead the World of Men against Sauron's army to draw his gaze from Frodo and Sam as they approach Mount Doom with the One Ring. | 94.0 | Peter Jackson | Elijah Wood | Elijah Wood | Elijah Wood | Elijah Wood | 1642758 | 377845905.0 | 94000000.0 | 20.7345 | 25403.0 | 1.118889e+09 |
| 5 | 5 | 6 | Pulp Fiction | 1994.0 | A | 154.0 | Crime, Drama | 8.9 | The lives of two mob hitmen, a boxer, a gangster and his wife, and a pair of diner bandits intertwine in four tales of violence and redemption. | 94.0 | Quentin Tarantino | John Travolta | John Travolta | John Travolta | John Travolta | 1826188 | 107928762.0 | 8000000.0 | 17.2460 | 29017.0 | 2.139288e+08 |
| 6 | 6 | 7 | Schindler's List | 1993.0 | A | 195.0 | Biography, Drama, History | 8.9 | In German-occupied Poland during World War II, industrialist Oskar Schindler gradually becomes concerned for his Jewish workforce after witnessing their persecution by the Nazis. | 94.0 | Steven Spielberg | Liam Neeson | Liam Neeson | Liam Neeson | Liam Neeson | 1213505 | 96898818.0 | 22000000.0 | 17.4025 | 16671.0 | 3.213656e+08 |
| 7 | 7 | 8 | Inception | 2010.0 | UA | 148.0 | Action, Adventure, Sci-Fi | 8.8 | A thief who steals corporate secrets through the use of dream-sharing technology is given the inverse task of planting an idea into the mind of a C.E.O. | 74.0 | Christopher Nolan | Leonardo DiCaprio | Leonardo DiCaprio | Leonardo DiCaprio | Leonardo DiCaprio | 2067042 | 292576195.0 | 160000000.0 | 26.7541 | 37863.0 | 8.390306e+08 |
| 8 | 8 | 9 | Fight Club | 1999.0 | A | 139.0 | Drama | 8.8 | An insomniac office worker and a devil-may-care soapmaker form an underground fight club that evolves into something much, much more. | 66.0 | David Fincher | Brad Pitt | Brad Pitt | Brad Pitt | Brad Pitt | 1854740 | 37030102.0 | 63000000.0 | 24.5456 | 30680.0 | 1.008538e+08 |
| 9 | 9 | 10 | The Lord of the Rings: The Fellowship of the Ring | 2001.0 | U | 178.0 | Action, Adventure, Drama | 8.8 | A meek Hobbit from the Shire and eight companions set out on a journey to destroy the powerful One Ring and save Middle-earth from the Dark Lord Sauron. | 92.0 | Peter Jackson | Elijah Wood | Elijah Wood | Elijah Wood | Elijah Wood | 1661481 | 315544750.0 | 93000000.0 | 23.9810 | 26322.0 | 8.713684e+08 |
| Unnamed: 0.1 | Unnamed: 0 | Title | Year | Certificate | Runtime | Genre | Rating | Overview | scoreAvg | Director | Star1 | Star2 | Star3 | Star4 | Votes | Revenue | tmdb_budget | tmdb_popularity | tmdb_vote_count | tmdb_revenue | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 989 | 989 | 990 | Giù la testa | 1971.0 | PG | 157.0 | Drama, War, Western | 7.6 | A low-life bandit and an I.R.A. explosives expert rebel against the government and become heroes of the Mexican Revolution. | 77.0 | Sergio Leone | Rod Steiger | Rod Steiger | Rod Steiger | Rod Steiger | 30144 | 696690.0 | 0.0 | 2.6488 | 1125.0 | 0.0 |
| 990 | 990 | 991 | Kelly's Heroes | 1970.0 | GP | 144.0 | Adventure, Comedy, War | 7.6 | A group of U.S. soldiers sneaks across enemy lines to get their hands on a secret stash of Nazi treasure. | 50.0 | Brian G. Hutton | Clint Eastwood | Clint Eastwood | Clint Eastwood | Clint Eastwood | 45338 | 1378435.0 | 4000000.0 | 2.7786 | 756.0 | 5200000.0 |
| 991 | 991 | 992 | The Jungle Book | 1967.0 | U | 78.0 | Animation, Adventure, Family | 7.6 | Bagheera the Panther and Baloo the Bear have a difficult time trying to convince a boy to leave the jungle for human civilization. | 65.0 | Wolfgang Reitherman | Phil Harris | Phil Harris | Phil Harris | Phil Harris | 166409 | 141843612.0 | 4000000.0 | 7.3546 | 6428.0 | 378000000.0 |
| 992 | 992 | 993 | Blowup | 1966.0 | A | 111.0 | Drama, Mystery, Thriller | 7.6 | A fashion photographer unknowingly captures a death on film after following two lovers in a park. | 82.0 | Michelangelo Antonioni | David Hemmings | David Hemmings | David Hemmings | David Hemmings | 56513 | NaN | 1800000.0 | 1.7583 | 1300.0 | 0.0 |
| 993 | 993 | 994 | A Hard Day's Night | 1964.0 | U | 87.0 | Comedy, Music, Musical | 7.6 | Over two "typical" days in the life of The Beatles, the boys struggle to keep themselves and Sir Paul McCartney's mischievous grandfather in check while preparing for a live television performance. | 96.0 | Richard Lester | John Lennon | John Lennon | John Lennon | John Lennon | 40351 | 13780024.0 | 560000.0 | 1.9403 | 706.0 | 11000000.0 |
| 994 | 994 | 995 | Breakfast at Tiffany's | 1961.0 | A | 115.0 | Comedy, Drama, Romance | 7.6 | A young New York socialite becomes interested in a young man who has moved into her apartment building, but her past threatens to get in the way. | 76.0 | Blake Edwards | Audrey Hepburn | Audrey Hepburn | Audrey Hepburn | Audrey Hepburn | 166544 | NaN | 2500000.0 | 4.7021 | 4312.0 | 9500000.0 |
| 995 | 995 | 996 | Giant | 1956.0 | G | 201.0 | Drama, Western | 7.6 | Sprawling epic covering the life of a Texas cattle rancher and his family and associates. | 84.0 | George Stevens | Elizabeth Taylor | Elizabeth Taylor | Elizabeth Taylor | Elizabeth Taylor | 34075 | NaN | 5400000.0 | 2.8508 | 721.0 | 32855818.0 |
| 996 | 996 | 997 | From Here to Eternity | 1953.0 | Passed | 118.0 | Drama, Romance, War | 7.6 | In Hawaii in 1941, a private is cruelly punished for not boxing on his unit's team, while his captain's wife and second-in-command are falling in love. | 85.0 | Fred Zinnemann | Burt Lancaster | Burt Lancaster | Burt Lancaster | Burt Lancaster | 43374 | 30500000.0 | 1650000.0 | 3.0902 | 669.0 | 30500000.0 |
| 997 | 997 | 998 | Lifeboat | 1944.0 | NaN | 97.0 | Drama, War | 7.6 | Several survivors of a torpedoed merchant ship in World War II find themselves in the same lifeboat with one of the crew members of the U-boat that sank their ship. | 78.0 | Alfred Hitchcock | Tallulah Bankhead | Tallulah Bankhead | Tallulah Bankhead | Tallulah Bankhead | 26471 | NaN | 1590000.0 | 1.7674 | 455.0 | 1000000.0 |
| 998 | 998 | 999 | The 39 Steps | 1935.0 | NaN | 86.0 | Crime, Mystery, Thriller | 7.6 | A man in London tries to help a counter-espionage Agent. But when the Agent is killed, and the man stands accused, he must go on the run to save himself and stop a spy ring which is trying to steal top secret information. | 93.0 | Alfred Hitchcock | Robert Donat | Robert Donat | Robert Donat | Robert Donat | 51853 | NaN | 0.0 | 5.3971 | 995.0 | 0.0 |